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Abstract 

Random  variables  X and  Y are  mutually  completely  dependent  if 
there  exists  a one-to-one  function  g for  which  P[Y  = g(X)]  = 1.  An 
example  is  presented  of  a pair  of  random  variables  which  are 
mutually  completely  dependent,  but  "almost"  independent.  This 
example  motivates  considering  a new  concept  of  dependence,  called 
monotone  dependence , in  which  g above  is  now  required  to  be 
monotone.  Finally,  this  monotone  dependence  concept  leads  to 
defining  and  studying  the  properties  of  a new  numerical  measure  of 
statistical  association  between  random  variables  X and  Y defined  by 
sup  {corr  [ f (X) , g (Y) ] > , where  the  sup  is  taken  over  all  pairs  of 
suitable  monotone  functions  f and  g. 
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1.  Introduction  and  summary.  A random  variable  (rv)  Y is  defined 
(see  Lancaster  (1963))  to  be  completely  dependent  pn  a rv  X if  there 
exists  a function  g such  that 
(1.1)  P[Y  = g(X) ] = 1. 

Intuitively,  Y is  completely  dependent  on  X if  Y is  perfectly 
predictable  from  X.  The  rv's  X and  Y are  defined  (see  Lancaster  (1963)) 
to  be  mutually  completely  dependent  (MCD)  if  Y is  completely  dependent 
on  X and  X is  completely  dependent  on  Y.  Equivalently,  X and  Y are  MCD 
if  (1.1)  holds  for  some  one-to-one  function  g.  The  concept  of  mutual 
complete  dependence  is,  in  a real  sense,  directly  opposite  to  that  of 
stochastic  independence,  in  that  mutual  complete  dependence  entails 
complete  predictability  of  either  rv  from  the  otljer,  while  stochastic 

i 

independence  entails  complete  unpredictability. 

An  important  measure  of  dependence  between  two  nondegen£fate  rv's  X 
and  Y is  that  of  sup  correlation,  introduced  by  Gebelein  (1941) , studied 
among  others  by  Renyi  (1959)  and  Sarmanov  (1958a, b) , and  defined  by 

P 1 (X, Y)  = sup  p [f  (X)  ,g  (Y)  ] , 

where  the  supremum  is  taken  over  all  Borel-measurable  functions  f,  g, 
such  that  0 < Vat  f(X)  < oo  and  0 < Var  g(Y)  < oo,  and  where  p 
represents  the  ordinary  (Pearson  product  moment)  correlation.  Tbe 
properties  of  sup  correlation  as  a measure  of  dependence  are  discussed 
in  Renyi  (1959).  It  is  clear  that  two  rv's  which  are  MCD  have  sup 
correlation  1,  but  that  the  converse  is  not  true.  (See  Lancaster  (1963) 
for  a discussion  of  necessary  and  sufficient  conditions  for  the  complete 
mutual  dependence  of  random  variables.) 
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Clearly,  if  a sequence  ( (xn»  Yn) } of  pairs  of  independent  rv's 
converges  in  law  to  a pair  (X,  Y)  of  rv's,  then  X and  Y must  be 
independent.  It  might  be  conjectured  that  if  a sequence  ( (xn,  } of 
pairs  of  MCD  rv's  converges  in  law  to  a pair  (X,  Y)  of  rv's,  then  X and 
Y must  be  MCD.  As  is  shown  below,  this  conjecture  is  false.  In  fact. 
Section  2 presents  a sequence  of  pairs  of  MCD  rv's,  all  having  the  same 
marginals,  which  converges  to  a pair  of  independent  rv's.  This  defect 
of  mutual  complete  dependence  motivates  a new  concept  of  total 
statistical  dependence,  called  monotone  dependence,  which  is  defined  and 
studied  in  Section  3. 

When  two  rv's  are  neither  totally  statistical  dependent  nor  totally 
independent,  it  is  often  useful  to  have  a numerical  measure,  such  as  the 
correlation  coefficient,  to  express  the  extent  to  which  the  rv's  are 
related.  A new  numerical  measure,  called  monotone  correlation,  is 
presented  and  examined  in  Section  4.  This  new  measure  is  related  in 
Section  5 to  the  concept  of  uniform  representations  of  bivariate 
distr ibutions. 

2.  MCD  rv's  which  are  almost  independent.  This  section  presents 
sequences  {U^}  and  { V^}  of  rv's  all  having  a uniform  distribution  on 
(0,1)  such  that  for  each  n,  Un  and  Vn  are  MCD,  but  that  the  pairs  (Un, 
VR)  converge  in  law  to  a pair  (U,V)  of  independent  rv's  each  having  a 

uniform  distribution  on  (0,1). 

. . . .2 

Partition  the  unit  square  into  n congruent  squares  and  denote  by 

(i,j)  the  square  whose  upper  right  corner  is  the  point  with  coordinates 

2 2 

x * i/n,  y • j/n.  Similarly,  partition  each  of  these  n squares  into  n 
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subsquares  and  let  (i,j,p,q)  denote  subsquare  (p,q)  of  square  (i,j). 

_2 

Now  let  the  bivariate  rv  (Un,  Vn)  distribute  mass  n uniformly  on 
either  one  of  the  diagonals  of  each  of  the  n subsquares  of  the  form 
(i,j,j,i)  for  1 i i i n,  1 i j i n.  Figure  1 illustrates  the  case 
n = 3.  (INSERT  FIGURE  1 HERE) . 


* 


Figure  1.  Support  of  the  Distribution  of  (U^,V^) . 

THEOREM  1.  Each  of  the  rv's  Un,  Vn  has  a uniform  distribution  on 

(0 » 1) . For  each  n the  rv's  Un  and  Vn  are  MCD.  The  sequence  { (Un , V ) } 

converges  in  law  to  a pair  (U,V)  of  independent  uniform  rv's. 

PROOF.  For  each  n,  it  is  clear  that  U and  V are  MCD.  Also, 

n n 

since  Un  and  Vn  each  assigns  mass  n-1  uniformly  to  each  interval 

((i-l)/n,  i/n) , it  is  clear  that  U and  V have  uniform  distributions  on 

n n 

(0,1).  Finally,  since  (Un,  Vn)  assigns  total  mass  n-^  to  each  of  the  n^ 
large  squares,  limn  P[Un  <,  u,  Vn  £ v]  = uv  for  each  point  (u,v)  in  the 
unit  square.  Q 
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Now,  let  F and  G be  any  pair  of  continuous  cumulative  distribution 

functions  (cdf's).  It  is  easy  to  generate  sequences  { X^}  and  { Yn)  of 

rv's  with  respective  marginals  F and  G such  that  X and  Y are  MCD  for 

n n 

each  n,  yet  { (Xn,  Yn) } has  joint  limiting  distribution  F-G  and  hence  are 

asymptotically  independent.  To  do  this,  define  Xn  = F ^(un)  and 

Y_  = G 1 (V  ) where  U and  V are  as  above  and  where  for  any  continuous 
n n n n 

cdf  K,  we  define 

(2.1)  K ^ (t)  = inf  {x:  K(x)  £ t} . 

This  method  of  generating  bivariate  cdf's  having  specified  continuous 
marginals  from  bivariate  cdf's  having  uniform  marginals  is  the  method  of 
translation.  (See  for  example  Mardia  (1970) , Kimeldorf  and  Sampson 
(1975a) .) 

3.  Monotone  dependence.  The  preceding  example  of  pairs  of  MCD 
rv's  which  are  almost  independent  suggests  that  mutual  complete 
dependence  is  too  broad  a concept  to  be  an  antithesis  of  independence. 
We  therefore  propose  the  following  concepts  of  total  dependence. 

DEFINITION.  Let  X and  Y be  continuous  rv's.  Then  Y is  monotone 
dependent  on  X if  there  exists  a monotone  - function  g for  which 
P[Y  - g(X)J  * 1. 

It  is  easy  either  to  verify  directly  or  to  conclude  as  a corollary 
to  Theorem  2 below  that  Y is  monotone  dependent  on  X if  and  only  if  X is 
monotone  dependent  on  Y.  We  can  therefore  make  the  following 
definitions. 

DEFINITIONS.  Two  continuous  rv's  X and  Y are  monotone  dependent  if 
there  exists  a monotone  function  g for  which  P(Y  * g (X) ] = 1.  If  g is 


Page  5 


increasing,  X and  Y are  said  to  be  increasing  dependent;  if  g is 
decreasing,  X and  Y are  said  to  be  decreasing  dependent. 

Before  proceeding  to  show  that  monotone  dependent  rv's  cannot  be 
"almost"  independent  in  the  sense  described  in  Section  2,  we  review  some 
known  results  on  Frechet  bounds.  (See  Frechet  (1951).)  Let  F and  G be 
cdf's.  Then 

H+(x,y)  = min  [F(x) , G(y) ] 

and 


H (x,y)  = max  [F (x)  + G(y)  - 1,  0] 

are  called  the  upper  and  lower  Frechet  bounds,  respectively,  of  the 
class  of  bivariate  cdf's  with  marginals  F and  G.  Both  H+  and  H are 
singular  bivariate  distributions;  H+  assigns  probability  1 to  the  set 
{(x,y):-  F(x)  = G(y)}  and  H to  the  set  {(x,y):  F(x)  + G(y)  = 1).  They 

are  bounds  in  the  sense  that  if  H is  any  bivariate  cdf  with  marginals  F 
and  G,  then  , * 

(3.1)  H- (x,y)  £ H(x,y)  £H+(x,y). 


(A  proof  of  (3.1)  appears  in  Johnson  and  Kotz  (1972,  pp.  22-23).) 

THEOREM  2.  Let  X and  Y be  continuous  rv's  with  respective  cdf's  F 
and  G.  A necessary  and  sufficient  condition  that  X and  Y be  increasing 
(decreasing)  monotone  dependent  is  that  the  joint  cdf  of  (X,Y)  is 
H+  0T) . 

PROOF.  The  sufficiency  is  immediate.  To  prove  the  necessity, 
assume  that  X and  Y are  increasing  monotone  dependent,  so  that  (1.1) 
holds  for  some  monotone  increasing  g.  If  s i t,  then 
(3.2)  F(t)  - F (s)  S P[g(s)  < g(X)  S g(t)  ] + P[g(X)  = g(s)l 


♦ 


I 
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= P [g (s)  < Y * q(t)]  + P[Y  = g(s)  ] 

= P [g  (s)  < Y < g (t)  ] 

^ G(g(t) ) - G(g(s) ) . 

Let  t oo  and  s 4 -oo  in  (3.2)  to  derive  1 £ G(g(oo))  - G(g(-co))  and 
hence  G(g(oo))  = 1 and  G(g(-oo))  = 0.  Let  s * -oo  and  set  t=x  in  (3.2) 
to  derive 

(3.3)  F(x)  ^ G(g(x) ) . 

Let  s=x  and  t •>  oo  in  (3.3)  to  derive  the  inequality 

(3.4)  1 - F (x)  1 1 - G(g(x) ) , 

which,  together  with  (3.3),  implies  that  F(x)  = G(g(x)).  Now,  if  H is 
the  joint  cdf  of  (X,Y),  then 

H(x,y)  = P[X  i *,  Y i y] 

= P [F (X)  ^ F (x) , G(Y)  £ G (y) ] 

= P [G (g (X) ) * F (x)  , G(Y)  * G(y)] 

= P (G  (Y)  £ F (x)  , G (Y)  * G(y)  ] 

= P (G (Y)  £ min  {F(x),  G(y)}] 

= min  (F(x) , G(y) }. 

A similar  argument  is  used  if  g is  decreasing.  0 

Theorem  2 is  a partial  justification  for  the  interpretation  of 
monotone  dependence  as  an  opposite  to  stochastic  independence.  Tbe 
theorem  implies  that  among  all  pairs  of  rv's  with  prescribed  marginals, 
those  which  are  as  dependent  as  possible  in  the  sense  of  (3.1)  are 
exactly  those  which  are  monotone  dependent.  Section  2 presented  a 
sequence  of  pairs  of  MCD  continuous  rv's  which  converges  in  law  to  a 
pair  of  independent  rv's.  The  following  theorem  shows  that  this  cannot 
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happen  for  pairs  of  monotone  dependent  continuous  rv's  by  showing  that 

the  property  of  monotone  dependence  is  preserved  under  weak  convergence. 

THEOREM  3.  Lf  { (Xn,  Yn)  } is  a sequence  of  pairs  of  monotone 

dependent  continuous  rv's  which  converge  in  law  to  a pair  (X,Y)  of 

continuous  rv's,  then  X and  Y are  monotone  dependent. 

PROOF.  Denote  by  and  H the  respective  bivariate  cdf's  of 

(Xn,Yn)  and  (X,Y) , and  denote  by  Fn,  Gn,  F,  and  G the  cdf's  of  Xn,  Yn, 

X,  and  Y,  respectively.  Since  {(Xn,Yn)}  converges  in  law  to  (X,Y) , it 

follows  that  {Fn (x) } converges  to  F(x) , {Gn (y) } converges  to  G(y)  and 

there  exists  a subsequence  { (X  ,Y  )}  such  that  either  X and  Y are 

nk  nk  nk  nk 

increasing  monotone  dependent  for  all  k or  decreasing  monotone  dependent 

for  all  k.  It  follows  in  the  former  case  by  Theorem  2 that  H (x,y)  = 

k 

min  {F  (x)  , G (y) } , which  converges  to  H(x,y)  = min  {F(x),  G (y) } . 
nk  nk 

Therefore,  X and  Y are  increasing  monotone  dependent.  A similar 

argument  holds  if  X and  Yn  are  decreasing  monotone  dependent  for  each 
nk  nk 

k.  D 

4.  Monotone  correlation.  Two  continuous  rv's  X and  Y are  monotone 
dependent  if  there  exists  a perfect  monotone  relation  between  than.  If 
the  rv's  are  not  perfectly  monotonically  related,  it  may  be  useful  to 
measure  nunerically  the  degree  of  monotone  dependence  between  than.  One 
such  measure,  called  monotone  correlation,  can  be  defined  as  follows: 

DEFINITION.  Hie  monotone  correlation  p*  between  two  nondegenerate 
rv's  X and  Y is 

(4.1)  p* (X, Y)  = sup  p[f (X) , g(Y) ] , 

where  the  supremm  is  taken  ovet  all  monotone  functions  f,  g,  for  which 
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0 < var  f (X)  < oo  and  0 < var  g(Y)  < go. 

It  is  cleat  that  if  two  tv's  are  monotone  dependent,  then  their 
monotone  correlation  is  1.  To  see  that  the  converse  implication  fails, 
let  (X,Y)  have  a uniform  distribution  over  the  region  [(0,1)  X (0,1)]  U 
[(1,2)  X (1,2) 1 so  that  X and  Y ate  not  monotone  dependent,  although 
p*  (X, Y)  2p[I^q  ^ (X) , I^q  -^(Y)]  = where  I denotes  the  indicator 
function.  It  is  obvious  that 

P(X,Y)  1 p*  (X, Y)  £ p'  (X,Y) . 

It  can  be  easily  seen  that  p*  is  not  identical  to  p'.  For  example, 
let  (X, Y)  have  a uniform  distribution  on  the  region  [(0,1)  X (0,1)]  U 
[(0,1)  X (2,3)]  U [(1,2)  X (1,2)]  U [(2,3)  X (0,1)]  U [(2,3)  X (2,3)] 
and  let  f = 1)  + 1 (2  3) ' so  < * • butp'(X,Y)  2 

p[f (X) ,f (Y) ] = 1. 

While  correlation  as  a measure  of  dependence  is  invariant  under 
changes  of  scale  and  location  in  X and  Y,  monotone  correlation  is 
invariant  under  all  order -preserving  or  order-reversing  transformations 
of  X and  Y.  Thus,  monotone  correlation  would  be  a suitable  measure  of 
association  for  ordinal  data.  For  a further  discussion  of  measures  of 
association  for  ordinal  data,  the  reader  is  referred  to  Kruskal  (1958) 
and  Gibbons  (1971,  Chap.  12). 

Any  candidate  for  a measure  of  association  should  have  the  property 
of  being  zero  when  the  rv's  are  independent.  Clearly,  correlation,  sup 
correlation,  and  monotone  correlation  all  have  this  property.  It  would 
also  be  desirable  for  a measure  of  association  to  satisfy  the  converse 
implication,  namely  that  it  be  zero  only  when  X and  Y are  independent. 
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Correlation  cleat ly  does  not  satisfy  this  convetse  property,  although 
sup  correlation  does.  (See  Renyi  (1959)).  The  following  theorem  shows 
that  monotone  correlation  satisfies  this  converse  implication.  Hie 
proof  of  the  theorem  is  essentially  similar  to  that  given  by  Renyi  for 
sup  correlation. 

THEOREM  4.  If.  X and  Y are  nondegenerate  rv's  with  monotone 
correlation  zero,  then  X and  Y are  independent. 

PROOF.  Suppose  p* (X,Y)  = 0.  For  any  real  t,  define  f = I,  . . 
We  claim  that  p[fg (X) ,ft(Y) ] = 0.  For  if  not,  then  either 

(D(fs(X)  ,gfc(Y)  ] > 0 or  p[fs(X)  ,-gfc(Y)  ] > 0,  which  contradicts  the 

hypothesis.  Now,  p[fg(X) ,ft(Y)]  = 0 implies  that  P[X  £ s,  Y £ t]  = 

P[X  <;  s]-P[Y  ^ t] , which  implies  independence.  □ 

5.  Uniform  representation  and  monotone  correlation.  Let  H be  a 
continuous  bivariate  cdf  with  marginal  cdf's  F and  G.  The  uniform 
representation  UH  of  H as  defined  by  Kimeldorf  and  Sampson  (1975b)  is 

(5.1)  UH(u,v)  = H(F_1(u) ,G_1(v)) , 0 < u £ 1,  0*v£l, 

where  F * and  G ^ are  as  defined  by  (2.1).  Observe  that  UH  is  a cdf  on 

* 

the  unit  square  with  both  marginal  distributions  being  uniform  on  (0,1). 
Thus,  the  class  of  all  continuous  bivariate  cdf's  can  be  decomposed  into 
equivalence  classes  determined  by  the  equivalence  relation 

(5.2)  Hj  - H2  iff  UH  = UH  . 

If  (X,Y)  and  (V,W)  have  continuous  cdfs  H and  K,  respectively,  we  write 
(X,Y)  - (V,W)  whenever  H - K. 
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LEMMA  1.  Let  (X,Y)  and  (V,W)  have  continuous  bivariate  cdf 's  H and 
K,  respectively.  Then  H - K (i.e.  (X,Y)  - (V,W) ) ^f  and  only  if  there 

exist  increasing  functions  A and  B such  that  the  joint  cdf  of  A(X)  and 
B(Y)  is  K. 

PROOF.  Denote  the  marginals  of  H by  Fx  and  Fy  and  the  marginals  of 
K by  Fy  and  Fw>  Suppose  there  exist  increasing  functions  A and  B such 
that  (A(X) ,B(Y) ) has  continuous  cdf  K.  Since  A(X)  is  a continuous  rv, 
the  marginal  cdf  of  A(X)  is  F(s)  = P [A(X)  £ s]  = P [X  ^ A 1 (s)  ] = 
(F^oa  ^) (s)  , where  A ^ is  as  defined  by  (2.1).  Similarly,  the  marginal 
cdf  of  B(Y)  is  (FyOB  *) (t) . Therefore,  the  uniform  representation  of  K 
is 

UK(s,t)  = K[(A-1)_1OFx-1(s)  , (B'Wy'V)] 

= P [A (X)  * (A-1)_1OFx_1(s),  B(Y)  * (B'1)'1OFy(t)] 

= P [X  ^ Fx_1(s),  Y ^ Fy_1(t)] 

= H(FX_1(S),  Fy_1(t)), 

which  is  the  uniform  representation  of  H. 

Conversely,  suppose  H - K.  Let  A = Fv_1ofx  and  B = Fw_1OFy.  Then 
A(X)  has  cdf  Fy  and  B(Y)  has  cdf  Fw»  Moreover,  by  (5.1)  the  uniform 
representation  of  the  joint  cdf  of  (A(X) , B(Y) ) is 
U(s,t)  = P[A(X)  * Fy_1(s),  B(Y)  * Fw-1  (t)  ] 

= P[FX(X)  S s,  Fy (Y)  * t] 

= P[X  * Fx_1(s),  Y * Fy_1(t)], 

which  is  the  uniform  representation  of  H,  hence  of  K.  Finally,  since 
the  joint  cdf  of  (A(X),  B(Y))  has  the  same  marginals  as  K and  also  the 
same  uniform  representation  as  K,  the  joint  cdf  is  K.  0 
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An  elementary  relationship  between  the  concepts  of  uniform 
representation  and  monotone  correlaion  is  that 

(X, Y)  - (V,W)  implies  p*(X,Y)  = p*(V,W). 

This  relationship  follows  directly  from  Lemma  1.  A further  relationship 
between  the  concepts  of  uniform  representation  and  monotone  correlation 
is  expressed  by  the  following  theorem,  whose  proof  requires  an 

additional  lemma. 

THEOREM  5.  Let  (X,Y)  have  continuous  bivar iate  cdf  H.  Then 
(5.3)  p*(X,Y)  = sup  { |p(V,W)  I : (V,W)  ~ (X,Y)}. 

LEMMA  2.  Given  nondegenerate  rv's  X and  Y,  let  f and  g be 
increasing  functions  for  which  0 < var  f(X)  < oo  and  0 < var  g(Y)  < co. 
Then  there  exist  sequences  { fn>  and  (gn)  of  str ictly  increasing 
functions  for  which  var  fn(X)  < oo,  vat  9n00  < 00 ' and  limn  p(fn(X) , 
gn(Y))  = p[f(X),  g(Y) ] . 

PROOF.  We  use  the  fact  that  any  increasing  function  can  be 
uniformly  approximated  by  a strictly  increasing  function.  Let  (fn)  and 
{g^}  be  sequences  of  strictly  increasing  functions  converging  uniformly 
to  f and  g,  respectively.  Since  f(X)  and  g(Y)  have  finite  nonzero 
variances,  we  have  E[fn(X)]  »E[f(X)],  E [gn (Y) ] *E[g(Y)],  E[fn2(X)]  * 

E [f2 (X) ) , E(gn2(Y)]  E [g2 (Y) ] , and  E[fn(X)gn(Y) ) ■>  E [f  (X)g(Y) ] , so  that 
p(fn(X),  gn(Y)]  *p[f(X),  g(Y)].  0 

PROOF  OF  THEOREM.  Let  any  number  G > 0 be  given.  By  the 

definition  of  p*  and  Lemma  2,  there  exist  strictly  increasing  functions 
f and  g such  that  lp*(X,Y)  - lp[f(X),  g(Y)]|  I < G.  Thus,  the  pair 


f 
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(V=f (X) , W=g(Y))  has  a continuous  joint  cdf  and  Lemma  1 can  be  applied 
to  conclude  that  (V,W)  - (X,Y).  Hence  the  left  side  of  (5.3)  cannot 
exceed  the  right  side.  To  prove  the  reverse  inequality,  suppose  (V,W)  - 
(X,Y) . Then  by  Lemma  1,  there  exist  increasing  functions  A and  B for 
which  p [A  (X)  , B(Y)  ] = p (V,W)  . Let  A'  = [sgn  p (V,W)  ] • A so  that  l,o(V,W)  I 
- o(A’(X),  B(Y)  ] * p*  (X, Y) . □ 

If  X and  Y are  univariate  rv's  with  respective  cdf's  F and  G,  then 
the  grade  correlation  (see,  for  example  Gibbons  (1971)),  which  is  the 
population  analog  of  Spearman's  rank  correlation  coefficient,  is  defined 
as  Og  = iO [F (X) , G(Y) ] . Thus,  the  grade  correlation  is  the  (ordinary) 
correlation  coefficient  of  the  uniform  representation,  and 
Pg(X,Y)  ^p*(X,Y)  **>'(X,Y). 

Note  that  the  probability  integral  transform  can  be  used  to  standardize 
an  ordinal  scale  by  divising  ranges  that  are  equal  in  terms  of 

probability.  In  computing  relationships  between  two  such  ordinal 
variables,  therefore,  the  rank  correlation  (or  grade  correlation)  is  a 
useful  device.  However,  it  might  be  argued  that  the  scaling  should  be 
done  in  an  absolute  fashion,  rather  than  relative  to  some  sort  of 

population  distribution.  What  the  monotone  correlation  measures  is  the 
maximal  correlation  that  might  be  achieved  under  any  such  monotone 
scaling. 

A continuous  bivariate  distribution  can  be  decomposed  into  two 

components:  its  structure,  by  which  is  meant  the  equivalence  class 

determined  by  the  equivalence  relation  - (defined  by  (5.2))  in  which  the 
distribution  belongs,  and  its  marginal  distributions.  Conversely,  given 


Page  13 


any  equivalence  class  and  any  pair  of  continuous  univariate 

distributions,  there  exists  a unique  bivariate  distribution  with  these 
two  components.  In  this  context,  Whitt  (1976)  posed  the  following 
problem:  If  the  marginals  are  fixed,  for  what  structure  is  the 

correlation  maximized?  Whitt  showed  that  the  maximum  correlation  is 
achieved  vdien  the  bivariate  distribution  is  the  upper  Frechet  bound,  and 
the  correlation  is  minimized  vhen  the  distribution  is  the  lower  Frechet 
bound. 

One  can  just  as  well  pose  the  reverse  problem:  If  the  structure  is 

fixed,  for  what  pair  of  marginals  is  the  correlation  maximized?  In 
general,  there  will  not  be  any  pair  of  marginals  for  which  the  maximum 
is  achieved;  on  the  other  hand.  Theorem  5 states  that  the  supremum  of 
the  correlations  is  exactly  the  monotone  correlation. 
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